A POINCARE SECTION FOR HOROCYCLE FLOW ON 
THE SPACE OF LATTICES 

J. S. ATHREYA AND Y. CHEUNG 

Abstract. We construct a Poincare section for the horocycle flow on 
the modular surface 5'I/(2, IR)/5'I/(2, Z), and study the associated first 
return map, which coincides with a transformation (the BCZ map) de- 
fined by Boca-CobeH-Zaharescu [8]. We classify ergodic invariant mea- 
sures for this map and prove equidistribution of periodic orbits. As 
corollaries, we obtain results on the average depth of cusp excursions 
and on the distribution of gaps for Farey sequences and slopes of lattice 
vectors. 



1. Introduction 

Let X2 = SL{2, M) /SL{2, Z) be the space of unimodular lattices in E?. X2 
is a non-compact finite- volume (with respect to Haar measure on SL{2,M.)) 
homogeneous space. X2 can also be viewed as the unit-tangent bundle to 
the hyperbolic orbifold ]HI^/S'L(2,Z). The action of various one-parameter 
subgroups of 5L(2,]R) on X2 via left multiplication give several important 
examples of dynamics on homogeneous spaces, and have close links to ge- 
ometry and number theory. 

For example, the action of the subgroup 

yields the geodesic flow on X2, whose orbits are hyperbolic geodesies when 
projected to M"^ / SL{2,Z). This flow can be realized as an suspension flow 
over (the natural extension of) the Gauss map G{x) = {^} , and this con- 
nection can be exploited to give many connections between the theory of 
continued fractions and hyperbolic geometry (see the beautiful articles by 
Series |33j or Arnoux for very elegant expositions). 

In this paper, we study the horocycle flow, that is, the action of the 
subgroup 

N=Us=( ]^ J ' 



The main result (Theorem 1.1 ) of this paper displays the horocycle flow as a 
suspension flow over the BCZ map, which was constructed by Boca-Cobeli- 
Zaharescu [H] in their study of Farey fractions. Equivalently, we construct a 
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transversal to the horocycle flow so that the first return map is the BCZ map. 
This enables us to use well-known ergodic and equidistribution properties 
of the horocycle flow to derive equivalent properties for the BCZ map (in 
particular that it is ergodic, zero entropy, and that periodic orbits equidis- 
tribute), and give a unified explanation of several number-theoretic results 
on the statistical properties of Farey gaps. We also give a 'piecewise-linear' 
description of the cusp excursions of the horocycle flow, and derive several 
new results on the geometry of numbers relating to gaps of slopes of lattice 
vectors. 



1.1. Plan of paper. We describe the organization of the paper, and also 
give a guide for readers. 



1.1.1. Organization. This paper is organized as follows: in the remainder of 
the introduction (Q, we state our results: in { 1.2, we describe the transver- 
sal to the horocycle flow, and state Theorem 1.1 In ^.3, we discuss the 



ergodic properties of the BCZ map ( ^1.3.2 ) and the structure and equidistri- 
bution of periodic orbits (^1.3.3[). A piecewise-linear description of horocycle 



cusp excursions is given m ; jl.4[ a unified approach to Farey statistics is de- 
scribed in j ]1.5t and a similar approach to statistics of gaps in slopes of lattice 
vectors is the subject of {1.6 In f|2| we prove Theorem |1.1[ and show how 
to describe it using Euclidean and hyperbolic geometry. The structure of 
periodic orbits is the subject of Q and in ^ these structure results are used 
to obtain equidistribution properties and the corollaries on Farey statistics 
in |5.1[ f|3] contains the proof of ergodicity and the calculation of entropy of 
the BCZ map. We prove our results on cusp excursions in ^ and in ^we 
prove our results on geometry of numbers. Finally, in ^ we outline some 
questions and directions for future research. 



1.1.2. Readers guide. Since this paper touches on several different topics, 
it can be read in several different ways. We suggest different approaches 
for the ergodic-theoretic and number-theoretic minded readers. We recom- 



mend that the ergodic-theoretic reader start with sections { 1.2 and { 1.3 (and 
perhaps ^1.4), and follow it with ^ fjs] and ^before exploring the more 
number theoretic parts of the paper. The number-theoretic reader should 



also start with i 1.2 and j: 1.3 but then may be more intrigued by the results 
and |1.6| and their proofs in Q^5.1 and ^respectively. 



of a. 3. 3, il.5 



1.2. Description of transversal. Recall that X2 can be explicitly identi- 
fied with the space of unimodular lattices via the identification 



gSL{2,Z) ^ gl? 



Let 
(1.1) 



Pa,b 



a b 




: a e 



,6e 
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denote the group of upper-triangular matrices in SL(2,M.). Let 

(1.2) n := {pa,bSL{2, Z) : a, 6 G (0, 1], a + 6 > 1} C Xg. 

By abuse of notation, we also use to denote the subset 

{(a, 6) G : a, 6 G (0, 1], a 6 > 1} C M^. 

In ^ we will show that can be viewed as the set of lattices with a 
horizontal vector of length at most 1, can also be identified with the sub- 
set {z = X + iy ^ M"^ : \x\ < > 1, \z\ > l} C of the upper-half plane. 
Our main theorem is that the J7 is a Poincare section for the horocycle flow, 
and that the first return map is the BCZ map. We see the space together 
with the roof in Figure [T} 

Figure 1. A picture of the suspension space over fi. Trajec- 
tories of the flow are vertical lines. Starting from (a, b) G 
hg trajectories move vertically until they hit the 'roof (at 
time i?(a, b)) upon which they return to the floor at position 
r(a,6). 




Theorem 1.1. Q C X2 is a Poincare section for the action of N on X2. 
That is, every N -orbit {/isAjsgK (with the exception of the codimension 1 set 
of lattices A with a length < 1 vertical vector), A G X2, intersects Q in a non- 
empty, countable, discrete set of times {sn}nei.- Given Aa^b = Pa,bSL{2,Z), 
the first return time R{a, b) = min{s > : hgAa^b S fi} is given by 

(1.3) ii(a,6) = ^. 
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The first return map T : — )• Q defined implicitly by 

^T{a,b) = hR(a,b)Xa,b 

is given explicitly by the BCZ map 
(1.4) T{a,b)=(b,-a + 



Remarks: 

Short periodic orbits: We will see below that lattices with a length 
< 1 vertical vector consist of those whose horocycle orbits are pe- 
riodic of period at most 1, and are embedded closed horocycles in 
]HI^/S'L(2, Z), foliating the cusp. Thus, from a dynamical point of 
view, there is no loss in missing them. 

Integrability of R: A direct calculation shows that R2dadb = 
which is the volume of X2 when viewed as the unit tangent bundle of 
W^/SL{2, Z) with respect to the measure — . It is also immediate 
that R G U'{dm) for p < 2, where dm = 2dadb. 



1 + a 



1.3. Return map and applications. The BCZ map has proved to be a 
powerful technical tool in studying various statistical properties of Farey 
fractions [5l 13 |T0] , and the distribution of angles of families of hyperbolic 
geodesies [TT| . 



1.3.1. Tiles and images. We briefly describe the basic structure of the piece- 
wise linear decomposition of T. (|1.4|) tells us that the map T acts via 



T{a,b) = {a,b)Al 



where 



1 

-1 k 



l+a I 



on the region := {{a,b) G Q, : K{a,b) = k}, where K{a,b) = [— 5- 
(see Figure [21 below) . $7i is a triangle with vertices at (0,1), (1,1), and 

(i,i);andfor^>2, f], is a quadrilateral with vertices at (l,f), (l,^,), 
(^^,^^, and Note that area(rii) = g, andfor k > 2, 

area(rifc) = fc(fc_|_i)(fc_|_2) = 0{k~^) so that k G U'{m) for 1 < p < 2. 

The image TQ^ is the reflection of 0.^ about the line a = b (see Figure [3]). 
However, T is orientation-preserving, and thus does not act by the reflection. 
For k = 1, T acts by the elliptic matrix Ai, for k = 2 the parabolic matrix 
A2, and for A: > 3, by the hyperbolic matrices, since trace(Afc) = k. 
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Figure 2. The Farey triangle ^^ = |Jfe>i ^fe 
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1.3.2. Ergodic properties of the BCZ map. In [7, §3], Boca-Zaharescu posed 
a series of questions on the ergodic properties of T: 

Question. Is T ergodic (with respect to Lehesgue measure)? Weak mixing? 
What is the entropy ofT? 



A corollary of Theorem 1.1 and the ergodicity and entropy properties of the 
horocycle flow is: 

Theorem 1.2. T is an ergodic, zero-entropy map with respect to the Lebesgue 
probability measure dm = 2dadb. Moreover, m is the unique absolutely con- 
tinuous invariant probability measure, and in fact is the unique ergodic in- 
variant measure not supported on a periodic orbit. 

1.3.3. Periodic orbits. The map T has a rich and intricate structure of pe- 
riodic orbits, closely related to the Farey sequences. A direct calculation 
shows that for Q G N the point Aj_ , = pj_ ^7? is /ig-periodic with (mini- 

mal) period . 

Given Q E N, the Farey sequence J-{Q) is the collection (in increasing 
order) of fractions < ^ < i with q < Q. Let N = N(Q) denote the 

cardinality of Q. We write J-'{Q) = {j = < ^2 = ^ < ■ ■ ■ < In}- For 
notational convenience, we write 7Ar+i = 1 = j. Writing = with 
qi < Q, we have qi + gj+i > Q, and 

ai+m -PiQi+i = 1- 
The following fundamental observation is due to Boca-Cobeli-Zaharescu [8]: 

The indices are v iewed cyclically in TLjNTL. We give a geometric explanation 



for (1.5) in ^4.1 below. Thus, (^,1) is a periodic point of order A'^ for T . 
To relate this to the periodic orbit of A^^ i_ for hg, we first record two simple 
observations. First, 

^ = (— \ 
Q'Qj W' 
and second, 

TV ^ N 

i=i i=i 

Multiplying both sides by Q^, and applying (1.5), we obtain 

^ /I \ 

f^^q^q^+l ^ \Q J 

Thus, as we sum the roof function R over the periodic orbit for T, we obtain 
the length of the associated periodic orbit for the flow. 



A POINCARE SECTION FOR HOROCYCLE FLOW 



7 



We recall Sarnak |32] showed that long periodic orbits for hs (i.e., long 
closed horocycles) become equidstributed with respect to ^2, the Haar mea- 
sure on X2. Our main result on the distribution of periodic orbits is the 
following discrete corollary of Sarnak's result: let pqj denote the proba- 
bility measure supported on (a long piece of) the orbit of (^,1): given 
/ = [a,/3] C [0, 1], let Ni{Q) := \T{Q) n /|, and define 

If / = [0, 1], we write pqj = pq. 

Theorem 1.3. For any (non-empty) interval I = [a, f3] C [0,1], the mea- 
sures Pqj become equidistributed with respect to m as Q ^ 00. That is, 
Pq,i ~^ ^) where convergence is in the weak-* topology. 



Remark. The case / = [0, 1] of Theorem 1.3 was proven in \27\ using different 



methods. Also, Theorem 1.3 can be deduced from Theorem 6 in [28], which 
is proven in the more general context of horospherical flows using a section 
that reduces to the same one in Theorem lUJ 



Theorem 1.4. A point (a, b) £ Q is T -periodic if and only if it has rational 
slope. In particular, the set of periodic points is dense. 

While periodic points are abundant, there are strong restrictions on the 
lengths of periodic orbits, and the associated matrices, governed by the 
relationship between the (discrete) period P(a, b) under T of a point (a, b) 
and the (continuous) period s(a, b) of the associated lattice Pa,b'^'^ under kg. 
We fix notation: for n > 1, we write 

An{a, b)=A {T^-\a, b)) A {T^-\a, b)) . . . A{a, b), 

so 

r"(a,5) = {a,b)An{a,bf. 
As a starting point for our observations, note that the diagonal (a, a) G 
Q consists of fixed (i.e, period P{a,a) = 1) points for T, the associated 
horocycle period is the value roof function R{a, a) = and the associated 
matrix 

Ap(a,a){a, a) = A{a, a) = A2 = 

is parabolic. Our main result on periodic points is that appropriate versions 
of these observations hold for all (segments of) periodic points. 

Theorem 1.5. For any periodic point (a, 6) G 17, we have 

P(a,6) = ivQVK«,b)J). 

Thus the set of possible periods is given by the cardinalities {N{Q) : Q G N} 
of Farey sequences. Moreover, the matrix 

^P(„,,)(a, b) = A (r^("''')-i(a, 6)) A (r^(-''')-2(a, 6)) . . . ^(a, b) 
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associated to the periodic orbit is always parabolic , and in fact constant 



along the segment |(ia,t6) : 
relatively prime, and a G 



t £ 



1 

a+b 



1 



|. Precisely, for k,l £ N, k < 



I 



I 

k 



P { a,a- 



and for a G 

A 



P(a,a 



I 



a, a J 



, 1 < r < k, we have 
Nil + r-1), 



1 - kl 
-k^ 



l + kl 



1.4. Piecewise linear description of horocycles. In this section, we 
describe tlie results that originally motivated this project, on cusp excursions 
and returns to compact sets for the horocycle flow. Let ||.|| denote the 
supremum norm on M^. For A = g7? G X2, define 



and let a : X2 



i{A): 
be given by 

a(A) 



inf llvl 



By Mahler's compactness criterion, a subset A of X2 is precompact if and 
only if there is an e > so that for all A G A, 

1{K) > e, 

or, equivalently if a\A_ is bounded. 

Dani [15j showed that for any lattice A G X2, the orbit {hsA}s>o is either 
closed or uniformly distributed with respect to the Haar probability measure 
fj, on X2- Thus for an A so that {hsA}s>Q is not closed, we have 



and 



limsup ai(/isA) = 00 



limsup£(/isA) = 0. 



In fact, this does not require equidistribution but simply density (due to 
Hedlund [21! )• {^sA} is closed if and only if A has vertical vectors. 

A natural question is to understand the rate at which these excursions to 
the non-compact part ('cusp') of X2 occur. The first-named author, in joint 
work with G. Margulis [1], showed that for any A G X2 without vertical 
vectors, 

log ai (/is A) 
iimsup > 1, 

s-s>oo log S 

and related the precise limit to Diophantine properties of the lattice A (see 
also [2]). 
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Our results concern the average behavior of all visits to the cusp, as 
defined by local minima of the function iA{s) = £(hsA) (or, equivalently, 
local maxima of aA(s) = a{hsA)). The function i\{s) is a piecewise-linear 
function of s, and helps give a picture of the'height' of the horocycle at 
time s. In this way it is similar to work of the second author [T^, where 
a piecewise linear description of diagonal flows on SL{3,M) / SL{3,7j) was 
studied. 

We also consider returns to the compact part of X2, given by local min- 
imal of ^a(s). Given A G X2 so that {/isA}s>o is not closed, let and 
{Sn} denote the sequences of minima and maxima of i\{s) respectively. To 
be completely precise, the local minima for the supremum norm occur in 
intervals, and we take Sn to be the midpoint of the n^^ interval. Define the 
averages 

1 ^ 1 ^ 

n=l 71=1 
N N 

n=l n=l 

Theorem 1.6. For any A without vertical vectors, we have 



ON 



(1.6) 



lim aAr(A) = 2 

Af-5-oo 



(1.7) 

Let M : O [0, 1] be given by 



(1.8) 



lim In[K) = - 



M{a, b) = max < a,b 



a + b 



Theorem 1.7. For any A without vertical vectors, we have 



(1.9) 



(1.10) 



N 



lim An {A) 



—dm = - 
M 3 



13 - 8\/2 ) ^ 1.2 



lim Ln{A) 

N^oo 



Mdm 



(^7-4^/2 



.73 



Theorem 1.6 and Theorem 1.7 follow from applying the ergodic theorem 
to the BCZ transformation T : — )• fi. The assumption that A does not 
have vertical vectors is to ensure it does not have a periodic orbit under 
T (equivalently, hg). There is a version of this result for periodic orbits 



(Corollary 1.12) given in Q.5.5 The function M gives the maximum of the 
function t 
Theorem 



on a sojourn from the transversal The limits (|1.6|) and (|1.7|) in 
are the integrals of the functions g{a, b) 



1.6 



over O respectively. The times 

to n. 



A and h{a, b) = 
correspond to the return times of {/is A} 
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1.5. Farey Statistics. Theorem 1.3 has several number theoretic corollar- 
ies. We record results on spacing and indices of Farey fractions, originally 
proved using analytic methods in [SJ [HI [50] . Our results give a unified expla- 
nation for these equidistribution phenomena. For similar applications in the 
context of higher dimensional generalizations of Farey sequences we refer 
the reader to [3D] and 



1.5.1. Spacings and h-spacings. We now fix an interval / C [0, 1]. It is well 
known that the Farey sequences Fi{Q) become equidistributed in [0, 1) as 
Q — )• oo. A natural statistical question is to understand the distribution of 
the spacings (74+1—74). Since there are Nj{Q) points, and Nj{Q) ~ \I\:^Q^, 
the natural normalization yields the following question: given < c < d, 
what is the limiting (Q — )■ 00) behavior of 

\HeJ^i{Q):^\I\QHj,+i-j.)e[c,d]}\ ^ 
NiiQ) 



Since (5^(7i+i — 7^) = (i? o T* ^) 1 j , the above quantity reduces to 
(1-11) PQ,I (r'' 



7r2 7r2 ■ 



Since R ^ ^r\^' STTTj'^] ) compact, we can directly apply Theorem jl.sj to 
pTTT]) obtain a result of R. R. Hall [20]: 



Corollary 1.8. As Q ^ 00, 

\H e J'liQ) ■■ ^\I\QHl^+l-l^) g ic,d)}\ 

NiiQ) 



m I R 



vr TT 



We call this distribution Hall's distribution. A generalization of this result to 
higher-order spacings was considered by Augustin-Boca-Cobeli-Zaharescu |5j. 
They considered h-spacings: the vector of /i-tuples {h > 1) of spacings 
Vi,/t = ^i,h{Q) = {li+j - H+j-i)j=i e Given B = Y\i=i [ci, di], we have, 
as above, 



(1.12) 

where Rh : ^ 
and 



e :Fi{Q) : ^|/|Q2 v,,fe£g}| / 

=PQ,l[Rh 



NiiQ) 
is given by 



R^ia,b) = {R{p-'{a,b)))'' 



B 



vr 

Wi\ 



n 

i=l 



vr vr 
3 / 3 / 



Applying our equidistribution result to (|1.12 ), we recover Theorem 1.1 of [5j: 
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Figure 4. The distribution function for the gaps 




Corollary 1.9. As Q ^ oo, 

\H € :Fr{Q) : ^Q^vj^f, € B}\ 
NiiQ) 

1.5.2. Density of the limiting distribution. Corohary |1.8| allows one to ex- 
plicitly calculate the density of the limiting distribution of consecutive gaps, 
that is, of Hall's distribution. For d > 0, we define the distribution function 
Gc{d) to be the asymptotic proportion of (normalized) gaps of size at most 
b, that is, 



Gcid) 



lim 



\H e TiiQ) : ^\i\Q^{j,^,-j,) e (o,d)}| 



By Corollary 1 



we can write 

Gcid) = m\R- 



Ni{Q) 



31/1 

Since for d < the curve R{a,b) = ^^d does not intersect O, we have 
Gc{d) = 0. In particular, the distribution does not have any support at 

31/1 

0. For d > Gc{d) > 0. The distribution has another point of non- 
differentiability at d = 4-^, when the curve R{a,b) = intersects the 
bottom line a + b = 1. The picture of the distribution for \I\ = [0, 1) is given 
in Figure |4j reproduced from . In Figure [5| we display a picture of the 
region R~\ [c, d] ) . 



1.5.3. Indices of Farey fractions. Following [21j, we define the index of the 
Farey fraction 7, = |^ G J^iQ) by 
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Figure 5. The region R ^{[c d]) C 




In [8], this was reinterpreted in terms of the map T and the function k : 
17 —7- N. Recall that k{x, y) 



1+x 

y 



Precisely, we have 
1 



1 



Fix a (non-trivial) interval / C [0,1), and an exponent a £ (0,2). We 
consider the average 

1 



NiiQ) 



Applying Theorem 1.3 to (note that a G (0, 2) implies that G L^), we 



obtain a result originally due to Boca-Gologan-Zaharescu PU]: 

Corollary 1.10. As Q ^ oo, 

1 f °° 

TTTT^ E ^(7^r ^ / ^"d"i = 2EA;-area(J7fc). 

Here = K^^{k). 

1.5.4. Powers of denominators. We can also obtain general results on sums 



associated to Farey fractions by applying Theorem 1.3 to various functions 
in L^{^1, m). For example, if we define fs^t : — )• C by 

fs,t{a,b) = a%\ 

for s,t G C with 5}?s,3f?t > —1, and define 

Bs,t ■■= \\fs,t\\i = / fs,t{x,y)dm = 2 

JO. 



1 



r(g + i)r(t + i) 

(s + i)(t + i) r(s + t + 3) 

we obtain results originally due to Hall-Tanenbaum [23] . 
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Theorem 1.11. Fix a non-trivial interval I C [0, 1). Let s,t £ C. Then, if 
5Rs,Kt > -1, 

In particular, for s = 1 and t = 0, we have 
(1.14) lim 1^=1 xdm 



In addition, for s = — 1 and t = 0, 

(1.15) lim — ^ y ^ = / -dm = 2, 

and for s = t = —1, we have 

(1.16) lim 5^ = / — dm = — , 

Q->oo Ni{Q) qiQi+i Jn xy 3 

which yields the classical result 
1.5.5. Excursi ons. Finally, we remark that applying the equisitribution re- 



Ni{Q)^\I\^Q\ 



suit Theorem 1.3 to the functions M (and -^) defined in (1.8) above, we 
obtain an amusing statistical result on Farey fractions: 

Corollary 1.12. Let I C [0, 1) he a non-trivial interval. Then 
Q^^NiiQ) ^ Wn+i Q J S\ 

Q^^NiiQ) ^ VQ Q q^ + q^+lJ 3V 

1.6. Geometry of numbers. Let A G X2 be a unimodular lattice, and fix 
t > 0. Let 

St{A) = {Sl < S2 < ... < Sn < ■■■} 

denote the slopes of the lattice vectors in the vertical strip Vt CM? given by 

Vt:={{x,y):xeiO,t],y>0}, 

written in increasing order. Let 

G7v,t(A) = {sn+i -Sn:0<n<N} 

denote the sequence of gaps in this sequence, viewed as a set, so |GAr^t(A)| < 
A^. Our main geometry of numbers result states that for lattices A without 
vertical vectors, this sequences has the same limiting distribution as the gaps 
for Farey fractions, namely, Hall's distribution. That is: 
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Theorem 1.13. Suppose A does not have vertical vectors, and let < c < 
d < oo. Then 

lim hGN,tW n (c, d)\ = 2m{R-\c, d)). 

N^oo iV 

This result will follow from the application of the Birkhoff ergodic theorem 
to the orbit of A under the BCZ map, with observable given by the indicator 
function of the set R~^{c,d). If A does have a vertical vector, and thus a 
periodic orbit under the BCZ map, we have that for any t > 0, there is an 
A^'o = NQ{t) > so that Giy,t{^) = GNo,t{^) (as sets of numbers) for all 
A^ > A^o- We can use Theorem |1.3| to obtain the following: 



Corollary 1.14. LetO<c<d<oo. Then 

}}^J^\^mt),tW n ic,d)\ = 2m{R-\c,d)). 

2. Construction of Transversal 

In this section, we prove Theorem |1.1[ We first give an interpretation 
of the transversal 0, in terms of geometry of numbers in §2.1[ and prove 
Theorem 1.1 in ^ 2.1.1[ We also record some observations on slopes in ^2.1.2 



We show how can be interpreted in terms of hyperbolic geometry in ^2.2 



In ^2.3, we show a certain self-similarity property of the BCZ map T. 



2.1. Short horizontal vectors. Fix t > 0. We say a lattice A G X2 is 
t- horizontally short if it contains a non-zero horizontal vector v = (a, 0)"^ so 
that \a\ < t. Note that since A is a group, we can assume a > 0. We will 
call 1-horizontally short lattices simply horizontally short. We also have the 
analgous notions of t- vertically and vertically short. To prove Theorem |1.1[ 
we break it up into several lemmas. Our first lemma is: 

Lemma 2.1. 0, = {Aa^b ■ a,b & (0, 1], a + 6 > 1} is the set of horizontally 
short lattices. 

Proof. Since the horizontal vector (a, 0)"^ is in Aa^, and a < 1, clearly every 
lattice in Q is short. To show the reverse containment, suppose A is short. 
Then we can write A = Aa^' = Pa,b''^'^, with < a < 1, and 6' / 0. Let 
m G Z be such that 

1 — a < ma + b' < 1, 
i.e., m = [^^\ ■ Set b = ma + b'. Then, since pi^m G 5'L(2,Z), 



A = Aah' = Pa,b''^'^ = Pa,h'Pl,m7? = Aab = Aah G Q.. 



□ 



Next, we show that for any (a, 6) G ^2, that there is a s G (0,oo) so that 
hgAafi G and give a formula for the minimum s. 
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Lemma 2.2. Let {a,b) G il, so Aa^b S ^l. Let so = R{a,b) = Then 
/isqA G Q, and for every < s < sq, hgAa^ ^ il. Furthermore, 

Proof. Since consists of horizontally short lattices, we first note that the 
first time hgAa^ will have a horizontal vector is given by the equation 

-sb+^ = 0. 
a 

Thus we set sq = and a direct calculation shows 

fab 

nsoPa,b = y _1 Q 



-1 ^^(A(a,6n^ 



Let At(a, b) = L~5^J • Applying the matrix 

1 K{a,b) 
we obtain 

/oi\ /'/i^ ( ^ -a + K{a,b)b\ 

(2.1) hsoPa,b{A{a,b) ) =( q 5-1 )=l'T(a,b)- 

Thus, hsoAa^b = AT(a,b), as desired. □ 

Finally, we show that for any lattice A G X2 which is not vertically short, 
the orbit under {/is}s>o will intersect Q. 

Lemma 2.3. Let A G X2 so that A is not vertically short. Then there is a 
si G M so that /i^^A G $7. 

Proof. Let S = [—1,1]^ C M^. 5 is a square centered at 0, so is convex, 
centrally symmetric, and has area 4. Thus by the Minkowski convex body 
theorem, any lattice A must have a nonzero vector v G S". Since A is not 
vertically short, we can assume that v = (a, b)'^ is not vertical, that is, 
a 7^ 0. Further, we can assume that a > 0, otherwise we multiply by — 1. 
Let si = \. Then (0,0)"^ G /i^^A, so hg^A G Vl. 

□ 

2.1.1. Proof of Theorem \l . 1\ To prove Theorem |1.1[ we combine the above 
lemmas. Lemma |2.3| guarantees that all non- vertically short lattice horocy- 
cle orbits {/igAjsgK intersect ri, and Lemma |2.2| shows that if an /ig-orbit 
intersects Vt once, it must intersect O infinitely often (both forward and 
backward in time). To see that the set of intersection times is discrete, ob- 
serve that the roof function R{a,b) = ^ is bounded below by 1 on Q, so 
visits must be spread out at least time 1 apart. The calculation of the roof 
function R and the return map T are also given by Lemma |2.2[ 

Finally, we address the remark following Theorem |1.1[ A direct calcula- 
tion shows that if a lattice A has a vertical vector, then it is periodic under 
kg with period t"^, where t is the length of the shortest vertical vector. If A is 



16 



J. S. ATHREYA AND Y. CHEUNG 



vertically short, this means that it is periodic under hg of period < 1, so the 
/is-orbit of A cannot intersect We will see in §2.2| below that these orbits 
correspond to embedded closed horocycles, the family of which foliates the 
cusp of MV5L(2,Z). □ 

2.1.2. Observations on slopes. Given a unimodular lattice A, let 

{0 < Si < S2 < . . . < SAT . . .} 

denote the sequence of slopes of vectors in A n Vi, where 

Vi = {0 < X < l,y > 0} C 

is the vertical strip as in §1.6[ Then we see that si is in fact the first 
hitting time of of the positive orbit hgA, and the Sn are the subsequent 
hitting times, since is the set of horizontally short lattices. The slopes of 
vectors decrease by s under the horocycle flow hg (in particular, while hg 
does not preserve slopes, it preserves differences of slopes), and the BCZ 
map records them when they become horizontal, while also keeping track 
of their horizontal component (the coordinate a) and the next vector in the 
strip to become horizontal (the vector (6, a~^)^). The differences in slopes 
are the return times R, that is, for n > 1, 

(2.2) Sn+l-Sn = R{T''{hs,A)). 

This observation will be crucial for the proofs our results on the geometry 
of numbers, see ^ 

2.2. Hyperbolic geometry. We recall that the there is a natural identifi- 
cation of the the upper-half plane 

if = {z = X + iy: y > 0}. 

with the space of unimodular lattices with a horizontal vector, via 

z^^( I ^^Z2 = A. 



/y \ y J vv'Vv' 

That is, we identify z with the unimodular lattice homothetic to the lattice 
generated by 1 and z. This extends to an identification of all unimodular 
lattices with the unit-tangent bundle T^H^, where the point {z,i) (where i 
denotes the upward pointing tangent vector) is identified to the lattice with 
a horizontal vector, and the point (z, e*^i) is identified to the lattice rotated 
by angle 0. This identification is well-defined up to the action of SL(2, Z) 
by isometries on H^. In the standard fundamental domain for S'L(2,Z), 

X + iy £ : \z\ > 1, |x| < - 

the set of horizontally short lattices can be identified with the strip 

Ci := I z = X + iy e if : y > l,\x\ < ^ 
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Figure 6 . A hyperbolic picture of = Ci . The green circle 
is an /ig-orbit. Vertical lines are gt-orhits and horizontal lines 
are ttg-orbits. 




together with their upward pointing tangent vectors, see Figure [6j 

2.2.1. Geodesies and horocycles. The orbits of the one-parameter subgroups 



and 

TT ■= } 11. = I 

. 1 

the geodesic and opposite horocycle flows respectively, also have nice inter- 
pretations in terms of hyperbolic geometry. Under our choices, the action of 
{gt '■ t ^ 0} moves the upward pointing tangent vectors vertically upward in 
H^, and the flow {us} moves them horizontally. In particular, Q is preserved 
by the action of {gt : t < 0} and {ug}. In dynamical language, orbits of hg 
are leaves of the stable foliation for {gt : t < 0}, and {ug} are leaves of the 
strong unstable foliation. We are constructing a cross section of hg by taking 
a piece of the unstable foliation AU. That is, 0, C AU. 
In the Euclidean picture, direct calculations show that 

9t-^a,b = ^e-*/2a,e-*/2b and UgAafi = A^ fe+g^-i, 

so geodesic orbits are radial lines and opposite horocyclic orbits are vertical 
lines, see Figure [7j We will use these observations in crucial ways in ^2.3 
below. 

2.2.2. Identifications. Note that in the fundamental domain 

{z = X + iy : \z\ > 1, |x| < -} 
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Figure 7. Geodesic and opposite horocyclic orbits in the 
Euclidean picture of fi. 




for 5L(2,Z) acting on if, there is a natural identification of the vertical 
sides X = 2 and x = via the transformation 



which is a fractional linear transformation associated to the unipotent matrix 



In the Euclidean picture, we can also identify two boundaries of the region 
via the above unipotent matrix, namely the lines {x + y = 1 : < x < 1} 
and {y = l:0<x<l}. The resulting loop formed by the vertical segment 

{x = 1 : < y < 1} 

corresponds to the closed horocycle 

{z = l + iy :0<y <1} 

in the hyperbolic picture. Thus, the topology of 17 is that of a sphere with 
one puncture (corresponding to the point at oo in the hyperbolic picture 
and the point (0, 1) in the Euclidean picture) and one boundary component 
(corresponding to the loop described above). 

2.3. Self-similarity. The BCZ map has an extraordinary self-similarity 
property which can be seen naturally in both the hyperbolic and Euclidean 
pictures. Let t > 0, and let Qt denote the set of t-horizontally short lattices. 
Arguing as in Lemma [2. 1[ we have the identification 



z z + 1, 




(2.3) 



:= {(a, 6) € (0,t] : a + 6 > t}. 
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Appropriately modifying Lemma 2.3, we can define a return map (the t-BCZ 
map) 

which captures the /is-orbits of all lattices except those of t -vertically short 



lattices. Modifying the argument in Lemma 2.2, we have 

(2.4) Tt{x,y) = (y,-x+ — 

V I y . 

A direct calculation shows that the t-BCZ map is conjugate to the original 
(1-) BCZ map via the linear transformation Mf : ft ^ Qt given by 

Mt{a,b) = {ta,tb), 

that is 

(2.5) Tt o Mt = MtoT. 

Now assume t < 1 (if t > 1, the roles of t and 1 below should be reversed). 
Then the set of t-horizontally short lattices can also be identified with the 
subset $7^*) C fl given by 

J^W := {(x,y) en:x<t}. 

Let r(*) : fi^*) be the first return map of the BCZ map T to 

0^*^. Thus, T^*-* also represents the first return map of hg to the set of 
t-horizontally short lattices, and so T^*^ and Tt are conjugate. This conju- 
gacy can be made explicit via the map given by 

Lt(a,6) = (a,6-(l-t)), 

that is, 

TtoLt = LtoT^'\ 

In the hyperbolic picture, the set of t-horizontally short lattices can be 
identified with the subset 

Ct := l^z = X + iy elf : y > , \x\ < ^ 

Figures [8] and [9] show the Euclidean and hyperbolic pictures respectively. 
We can summarize this discussion with the following: 

Observation 1. T is self-similar in the following sense: For any < t < 1, 

the BCZ map T is conjugate to its own first-return map T^*) . 

Essentially, this self-similarity is a consequence of the following conjugation 
relation for gt and hg 

(2.6) gthsg-t = hg^-t, 

which in particular implies that the time-1 map hi and time-s map hg are 
conjugate for any s > (via the map ffiogs)- 
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Figure 8. fit and Q can be identified with Qt via the 
scaUng Mt, and fit and r^*^*-* via the vertical translation Lt. 



V = 1 



y = t 




Figure 9. A hyperbolic picture of Jit C 0, that is, Ct C Ci. 
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3. Ergodic properties of the BCZ map 



In this section, we prove Theorem 1.2, using Theorem 1.1 and well-known 
properties of suspension flows. We first recall these properties, which can 
be found, e.g., in [3T] . 

3.1. Suspension Flows. Let (X,^) be a measure space, and T : X ^ X 
a measure-preserving bijection. Let R : X ^ R+ be in L^{X,n). The 
suspension flow over T with roof function R is defined on the space 

(3.1) X^ ■.= {{x,t):xeX,te [0, R{x))}/ ~ 
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where {x,R{x)) ~ (r(x),0). The suspension flow (f>^''^ is given by 

</.f'^(x,t) = {x,s + t). 

The measure dvR;r = is a (/)^'-^-invariant probability measure on 

X^. It is ergodic if and only if /x is T-ergodic.The natural dual construction 
to suspension flow is the construction of a first return map for a flow. Given 
a measure- preserving flow (j) : {Y, v) — > (Y", v), and a subset X C y so that 
for almost all y G y, {t G M : (/>*y G X} is discrete, we define the first return 
map T -.X ^ Xhy T{x) = </.^(^)(x), where 

R{x) = min{t > : E X} 

is the first return map. There is a natural (possibly infinite) T-invariant 
measure /i on X so that R G L^{X,^). and the suspension flow 0^'-^ : 
[X^jVji^t) — ^ {X^,'^b.,t) is naturally isomorphic to the original flow (/>. 
There is also a map between the sets of invariant measures for the flow (j)^''^ 
and the map T, whose properties are summarized in the following: 

Lemma 3.1. Let {X,fi) be a measure space, and let T : (X, ^) — t- {X,fj,) 
be a fi-preserving bisection. Let R be a positive function in L^{X,fi). The 
map rj i — > riR,T, with drfji^T = jj^^dr]dt is a bisection between the set of 

T-invariant measures rj on X so that R G L^{X,rf), and the set of 4>^'^ 
invariant probability measures on X^ (and thus also between the ergodic 
invariant measures). Moreover, we have Abramov's formula relating the 
entropy of the flow (j)^''^ (that is, of its time 1-map) to the entropy of the 
map T: 



(3.2) h. 



\\'^\\l,r! 



3.2. Ergodicity. In this section, we prove the ergodicity and measure- 



classification parts of Theorem 1.2 The ergodicity of the BCZ map with 
respect to dm = 2dadb follows from 

Ergodicity: of the horocycle flow with respect to Haar measure /i2 on 

Suspension: The measure 2dadbds on X2 (viewing it as the suspen- 
sion space over Q) is an absolutely continuous invariant measure for 
{hs}. 

Measure classification: By Dani's measure classification [TS], this 
must be (a constant multiple of) ^^2. 

To show uniqueness, suppose v was another ergodic T invariant measure on 
n. Then it must be supported on a periodic orbit for T, since the measure 
dvds on X2 is a non-Haar ergodic invariant measure for {hg} and thus, 
by Dani's measure classification, must be supported on a periodic orbit for 
{hs}. □ 
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3.3. Entropy. We now prove the fact that the entropy hm{T) of T with 
respect to the Lebesgue probabihty measure m is 0, completing the proof of 



Theorem 1.2 We use Lemma 3.1), which states that the entropy hm{T) of 
the BCZ map with respect to m is proportional to the entropy of the horo- 
cycle flow h^^{hi) with respect to Haar measure. To show that hf^^i^i) — 0) 
we record a fact which seems to be well-known but does not have a complete 
proof in the literature as far as we are aware: 

Theorem 3.2. Let F C S'L(2,M) he a lattice. Let ^2 denote the finite 
measure on S'L(2,]R)/r induced by Haar measure. Then 

^^2(^1) = 0- 

Ln fact, the topological entropy 

htopihi) = 0. 

Proof. When F is a uniform lattice, that is, the quotient H^/F is compact, 
the second assertion is a theorem of Gurevic [19j, and together with the 
standard variational principle (see, e.g., Walters |35j). which states that 
the topological entropy is the supremum of the measure-theoretic entropies. 
When F is non-uniform, we can use a version of the variational principle 
for locally compact spaces due to Handel-Kitchens [23 , to again reduce the 
measure-theoretic statement to a topological statement. Since the horocycle 
flow is C°°, we can apply a theorem of Bowen [12J to conclude that 

htop{hi) < 00. 

On the other hand, since kg is conjugate to hi for any s > 0, we have 

htopihi) = htopQis) = shtopihi), 
which implies that htop{hi) = 0. □ 

4. Structure of Periodic Orbits 



In this section we prove Theorems 1.4 and 1.5 and describe in detail the 



relationship between periodic orbits for T and hs. We first record (in ^4.1) 



our geometric proof of the key observation (1.5) that certain periodic orbits 



of the BCZ map parameterize Farey fractions. 

4.1. Farey fractions and lattices. In this section, we give a geometric 
proof the following 

Lemma 4.1. Let Q he a positive integer, and let 

1 p- 1 
^{Q) = {7 = 71 < 72 = 7^ < • • • < 7i = - < • • • < 7iv = -}, 

1 Q qi 1 



N = N{Q) = ^q<:Q fiq) denote the Farey sequence. Then (1.5) holds, that 
is. 



Q J \Q Q 
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Remark: Here, i is interepreted cyclically (i.e., in Z/NZ). In particular, 
this is a periodic orbit for T (and for hs). Note also that the return time 
gives (normalized) gaps between the Farey fractions, that is, 



'i+i - 7ij- 

Proof. Geometrically, the sequence T{Q) correspond to the slopes of primi- 
tive integer vectors ( | in the (closed) triangle Tq with vertices at (0, 0), 
\ Pi J 

{Q,0), and {Q,Q)- Now note that since contains a vertical vector, it is 
periodic under hg, with period 1. Using the conjugation relation (2.6), we 
have that the lattice gtZ'^ has period e~* under hs, since 

Note that gt scales slopes (and their differences) by e~*. For Q &'H, setting 
tq = — 21nQ, consider the lattice 



2 



Note that we can also write this as 



Q J \ Q J \ 1 J \ Q 

This lattice corresponds to the point {Q^^, 1) in the Farey triangle Q,. By 
our earlier observation, this has period under hg- We are interested in 
the behavior of the orbit of this point under T. Recall that the BCZ map 
only 'sees' primitive vectors with horizontal component less than 1 that is, 
in the strip V = Vi, since hg does not change the horizontal component of 
vectors. Originally we were interested in Farey fractions of level Q, that 
is, the slopes of (primitive) integer vectors in the region Tq. Applying the 
matrix gtg, the region Tq is transformed into a subset V. Thus, the vectors 

^ G Tq correspond to vectors ^ ^Qp^^ ^ ^ dtQ^^Q Therefore, 

they are seen by the BCZ map. Precisely, gtqTq is the triangle with vertices 
at (0,0), (1,0), and (1,(5^). See FigurefTol 



As noted above, the (primitive) vectors ( * j G gtqZ'^ will be seen by 

the BCZ map in the order of their slopes (that is, at time (5^7j), and the 
time in between the i^^ and (i + 1)'^* vectors will precisely be the difference 
in their slopes 

Q^{ii+i - li)- 

( \ 

At time we are seeing the horizontal vector ( q 1 (corresponding to 
7o), and the next vector to get short is ( q j (corresponding to 71), which 
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Figure 10. The triangles T4 and gt^T4, with lattice points. 
Primitive lattice points in blue, others are in red . 

(1,16) 




will become horizontal in time 

R{Q-\ 1) = Q = q2 (^1 - 0) = Q\^i - 70). 

More generally, we have that T'^{Q~^, 1) corresponds to the lattice hQ2^^gtQlP' 
and noting 

^ f Qi Qi+i \ _ f Q'^Qi Q~^qi+i 



we obtain, as desired 

r{Q-\i 

and 



Q' Q J' 

R{T\Q-\l)) = Q\^,+^-^,). 
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In particular, the point (Q^^, 1) is T-periodic with period N{Q). Also note 
that 

N(Q)-1 

^ RiriQ-\l)) = Q^ 

1=0 

and thus, the sum of the return times over the discrete periodic orbit corre- 
spond, as they must, to the continuous period. □ 



4.2. Proof of Theorem 1.4, We first note that any periodic point (a, b) G 
Q for T corresponds to a periodic point Pa^b^"^ £ ^ for hs on X2 and vice- 
versa. Precisely, suppose (a, 6) G is periodic under T, and n = P{a, b) > 
is the minimal period, so 

T^{a,b) = {a,b) 
and T^{a, b) / (a, b) for j < n. Then 

where 

n-l 

s(a,6) = j;i?(r(a,5)). 

i=0 

Note that the points (a, a) € have period P{a, a) = 1, and that a direct 
calculation shows that s(a, a) = R{a,a) = 

Periodic orbits for hg occur in natural families, due the conjugation rela- 
tion (2.6). In particular, if A is periodic with period sq, 

(4.1) /i^ge-tgjA = gthsoA = gtA, 

so the flow period of gtg'^'^ is soe~*. We have 

gtAa,b = Agt/2^_gt/2j,, 

so we see that if (a, b) is periodic for T, so is the entire line segment 

1 



(ta,tb),l> t> 
^ ^ a + b 

Thus, to analyze which points have periodic orbits, it suffices to consider 
points of the form (1,6) or (a, 1). Since r(a, 1) = (1,1 — a), we consider 
points of the first kind, or, equivalently, lattices of the form Ai^;,. This is a 
periodic point under hg if and only if there exists an sq so that hsQ^i^b = Ai^b, 
or, equivalently, 

Pi,bhsoPi,b G SL{2,Z). 
A direct calculation shows that this is impossible if y is irrational, and if 
y = J the minimal such sq = and we have 

-I7 f 1+kl k"^ \ 

(4.2) P,,bKPi,t=[ _;2 )■ 

This proves Theorem 1.4, since we have shown that any point with rational 
slope is periodic. □ 
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4.3. Proof of Theorem 1.5 As above, we consider the point {l,b). If 
b=j,k<lGf^ relatively prime, then the period under hg is P. The line 
segment associated to this point is 



k\ / I 



Applying (2.6) to p^^klj , we can calculate the period 



s{t,t 



t2- 



To calculate the discrete period, we analyze the scalings of the Farey periodic 
orbits 



1 



1 



N{Q)-1 
i=0 



using the following proposition, a special case of Theorem 1.5 which we will 
use to prove the general result: 



Proposition 4.2. For t G , 1 



P t 



Q 



NiQ). 



This will follow from showing that T in fact is linear along the orbit of 
the segment |(^^) ^) • * ^ ^^^^l |) that is, the segment does not 'break 
up' into pieces. Precisely, we have: 



Lemma 4.3. For t G 1 , 1 < i < N{Q), 



K T t, 



Q 



K T 1 



Proof. Since TM 1, ^ j = f |, ^ j , we need to show, for t G f ^> 1 



qi Qi+i \ ( iQi tQi+i 



Q' Q 



g' Q 
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We have 
(4.3) 



tqi tq, 



'i+i 



Q' Q 



< 

< 
< 



2 + 0- 

■- + Qi 



Qi+i 
Q + l+Qi 
Qi+l 

Qi+i + qi+2 + Qi 



1 + 



Qi+i 
Q + Qi 



qi+i 



where in the last hne we are using the identity 



Qi + qi+2 



Q + qi 



li+i 



On the other hand, 



Thus, 



Qi+l ~ Qi+l ~ 

tqi tqi+i 



Q + qi 




Qi+l 





so 



Q' Q 

Q + q. 



Q + qi 

qi+l 

< u 



qi+l 



as desired. 

Now Proposition 



4.2 



tjU tqi+i 
Q' Q 

follows from P ( 1 , 4 



Q + qi 

qi+l 

(h qi+l 
.Q' Q 

N{Q). 



□ 
□ 



To prove Theorem 1.5, we observe that the point (l, j) is contained in the 
periodic orbit corresponding to the point (7,1), since we can find consecutive 
Farey fractions 71 = ^- and 7^+1 = in (we are assuming that 

gcd(A;,/) = 1). By Proposition 4.2, for a G (j:p3-, 1], 

p(a,aj^ =N{1). 



If we put a = , we have the point ^ , j , which is contained in the 
periodic orbi 
Proposition 



periodic orbit of the point yj^, Ij , by similar reasoning as above. Applying 



4.2 



to this point, we have that for b G yj^i 1 
bl bk 



iV(/ + l). 
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Rewriting, we have that for a G ^ 

k 



P[t,t 



N{l + l) 



Continuing to reason in this fashion, we have that for 1 < r < k, t E 

I I 



l+r^ l+r-1 



k 



P tA- 



N{l + r- 1). 



Finally, we need to calculate An (t, ij) , where n = P ( t, tj) . Letting sq 



s [a 



>4) 



I, we have, by a similar calculation to (4.2) that 



On the other hand, ( |2.1[ ) can be re-written as the conjugation 

PT{a,b)^Rici,b)Pa,b = A{a,bf. 
Iterating, we obtain, for any m > 0, (a, b) £ $7, 

PT^{a,b)^R{a,b)Pa,b = ^m(a, 6)^- 

Applying this to (a, h) = {t, tj) and m = n, we have 



l + kl 
-P 1 - kl 



{An{a,b)-1) 



l + kl k^ 
-P 1 - kl 



since T" (t, ij) = Inverting and taking transposes, we obtain, as 

desired 



1-kl f 

1 + kl 



completing the proof. 



□ 



4.4. Segments and the shearing matrix. Note that the matrix 



A 



P(a,af) 



k 



a, a- 



1 - kl 



l + kl 



is parabolic, and fixes the segment | (a, a j) : a £ (^j^, 1 | . In fact, it shears 
along this segment in the following fashion. Note that 



k-^ + P 



I k 
-k I 



1-kl P 
-k"^ 1 + kl 



I 



1 P + k^ 
1 



Thus for points (a, b) sufficiently near the segment, they will be sheared along 
the segment under the appropriate power of T (which may vary depending 
on the period of the subsegment {a,b) is close to). As it moves along the 
segment, the period will change. This is illustrated in Figure 11, which 
shows the periodic segment { (a, a|) : a E (f 1 1] }) which breaks up into two 
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Figure 11. The periodic segment { (a, a|) : a G (|, l] }. 
The segment a G (|, l) is in blue and the segment a € (|, |] 
is in red. The labels next to the segments correspond to the 
power of T. The red segment has period A^(4) = 6, and the 
blue segment has period A^(3) = 4. 





(0,0) 



(1,0) 



pieces, one of period 5 and one of period 6. The shearing matrix for this 
line is given by 

-5 9 
-4 7 

which is conjugate to 

' 1 13 
1 



4.5. Hierarchies of periodic orbits. Theorem 1.5 has a nice geomet- 
ric explanation, which we describe informally: If we start with the points 
(a, a) G i7 of period 1 under T, and move down the line, we hit (5,5), which 

is not in f2. However, we can identify Ai 1 with Ai , , since, in general, we 

2 ' 2 2 ' 

have 

Aa,l-a = Aa_i. 

The orbit of (^, l) has period iV(2) = 2, and continuing to move down the 



line. Proposition 4.2 shows that for a G (2/3, 1], 



Pf",a) =iV(2). 



,2' 

At a = |, we identify |) with the point l) , which has period A^(3) = 
5. Continuing this process, we obtain all the Farey periodic orbits f 
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and the scahng segments 



t \ ( Q 



The set of T-periodic points in Q is the union 



U U 1^' 



t \ f Q 



Q J VQ + i 



4.6. Hyperbolic Geometry of Periodic Orbits. What we have done in 
X2 is to push the cuspidal horocycle correpsonding to h.a,a, a > 1 down 
using Qt. For a > 1, the lattice A^^a is vertically short, and the hs trajectory 
is an embedded loop in X2 which does not intersect 0. For 1 > a > ^, 
the /i^-orbit it intersects our transversal once, on the point A^^a, which is 
a fixed point for the BCZ map T. As we push it down, the hg trajectory 
intersects the transversal more and more times. However, it does not simply 
pick up one intersection at a time, but rather N{Q + 1) — N{Q) = ip{Q + 1) 

intersections when we transition from (^g^i Cj+i) (qTT' ^ 
5. Equidistribution of periodic orbits 



In this section, we prove Theorem 1.3, which is the key result for our num- 



ber theoretic applications. This is essentially a direct consequence of the 
well-known equidistribution principle for closed horocycles on X2, originally 
due to Sarnak [32] and reproved by Eskin-McMullen |16] using ergodic the- 
oretic techniques. In our notation, and in terms of the map T, this result 
can be reformulated as follows. Recall that given a (non-empty) interval 
/ = [q,/3] C [0,1], and Q > 1, we defined pgj as the probability mea- 
sure supported on (a long piece of) the orbit of (^,1)- That is, setting 
Ni{Q) := |J^(Q) n/|, we had 

Define the measure pq j on X2 by setting 

dpQ,i = dpQ,idt, 
where we are identifying X2 with the suspension space over Q. 
Theorem 5.1. /I32],[l6],[25]y As Q 00, 

PQ,I /^2, 

where the convergence is taken in the weak-* topology. 

Remark. In [32j and [16J the result is only established for the case a = 
0,(3 = 1, although it is clear that the same arguments in |16j work for the 
case of fixed 0<a</3<l. A proof of this was outlined in [25]. Stronger 
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results where the difference /3 — a is permitted to tend to zero with Q have 
been obtained by Hejhal [26] and Strombergsson 



Let TT be the projection map from the suspension onto il. Note that vr 
is continuous except for a set of measure zero with respect to the measures 
Pq j and fi2- Thus, we have as Q — )• oo, 

1 R 1 

which completes the proof of Theorem 3.1. 

5.1. Farey Statistics. Our results on the statistics of Farey fractions are 



all immediate corollaries of Theorem 1.3, being statements of the form 

(5.1) / GdpQj^ [ Gdm 

Jn Jn 

for appropriate choices of functions G on Vt. In the applications to spac- 
ings and /i-spacings, the function G is in fact an indicator function, and so 
the convergence is immediate. In the setting of indices and moments, the 
functions G are L}{rn) functions, we need the following lemma. 

Lemma 5.2. For Q G N, set mq = pqj and let rrioo = rn. Let W he the 
set of measurable functions G : — )• M satisfying 



(5.2) l|G||vi/ := sup / \G\dmQ < oo 

QeNu{oo} Jn 

where two functions in W are identified if their difference has zero norm. 
Let Co(ri) he the space of continuous functions vanishing at infinity equipped 
with the uniform norm and let Wq he the closure of its image in W . Then 



(5.1) holds for any G G Wq. 



Proof. Let r/ : Cq{Q) W he the natural inclusion map, which we note 
is a continuous injection that fails to be an embedding. It induces a map 
r/* : W* — ^ M{Vt) between the dual W* with the weak-* topology and the 
space M(J7) of Radon measures on Vt. Each mq determines a linear map 
r/(Co(ri)) — )• M that is readily seen to be continuous even with the subspace 
topology on r/(Co(l^)) coming from W . This linear map extends uniquely to 
a linear map Wq — >• M of norm at most one. Given e > and G G Wq, there 
is sequence G^ G Cq{^Y) converging to G. Choose k so that \\Gk — G\\ < e/3, 
then choose Qq such that for all Q > Qq we have \mQ{Gk) — m{Gk)\ < e/3. 
The triangle inequality now gives 

ImgiG) - m{G)\ < 2\\G - Gk\\ + \mQ{Gk) - m{Gk)\ < e. 

This shows that (|5.1|) holds. □ 



Below, we indicate which functions go with which corollaries, and show how 
to verify the condition (5.2) in these settings. 
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5.1.1. Spacings and h-spacings. Corollary 1.8 follows from applying (|5.1|) 
to the indicator function of the set R^^ 



Similarly, Corol- 



lary 1.9 follows from applying (5.1) to the indic ator function of the set 
R^^{B). Since indicator functions are bounded, (5.2) holds immediately. 

□ 

5.1.2. Indices. To verify ( |5.2[ ) for k", we need: 

Lemma 5.3 ([9j, Lemma 3.4). Let r G N, n > 4r + 2. Suppose n{a, b) = n. 
Then 

(1) Fori = ±1, K{T^a,b)) = 1 

(2) For l<\i\< r, K{T'{a, b)) = 2 

That is, any large value n of k must be followed by roughly n/4 small values. 
Furthermore, by the definition of k, we can see that the maximum of k along 
the support of mq is at most 2Q, which is on the order of the square root 
of the length of the orbit Nj{Q), since 



(U <li+i 
Q' Q 



gi+i 



Q + qi 



li+i 



Combining these two facts, we have that there is a constant c = C/j, depend- 
ing on the interval / and the power so that for any < /? < 2, and any 
Q G N, 

mQ(K > iV) < cpN-^. 

Let < a < /3 < 2. Then 



Af=l 



Since m{n = k) 



N=l 

fc(fc+i)(fc+2) A; > 2, we have 
m(K°) < 8C(3-a), 



so we have verified (5.2) for k", proving Corollary 1.10 



□ 



5.1.3. Moments. Finally, to obtain Theorem 1.11, we apply (5.1) to the 
function 

Gs,t{a,b) = a'b\ 

t, s G C, Rs, > —1. To verify (5.2), we use that for (a, b) G Q, 

\Gs,t{a,b)\ < |G_i,_i(a,6)|, 

so it suffices to verify the condition in the case {s,t) = (—1, —1) . We need 
the following: 

Lemma 5.4. For any interval I C [0, 1) there is a constant ci such that for 
any Q with Nj(Q) > 0, we have 

NiiQ) > ciQ2. 
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Proof. Ni{Q) counts the number of primitive lattice points in the triangle: 



which has area jQ'^l/]- There are two cases. If Nq/2{I) > 1, then Theorem 3 
of [131 (with the sup norm) implies 

On the other hand, if Nq/2{I) = 0, the length of I is bounded above by the 
largest gap in Fq/2, which is at most 2/Q, so that 

Nq{I) > 1 > 

Thus, the lemma follows with the choice 

■ [E 



ci 



mm 



VlSvr' 4 



□ 



We have 



"iq(G_i _i) 



NiiQ) ^ qm+i 



\i\Q^ 

NiiQY 



Applying Lemma 5.4, we have that mQ(G_i^i) is unfiormly bounded in Q, 



and we have that m(G_i^i) 



completing the proof. 



□ 



5.1.4. Excursions. To prove Corollary 1.12 we apply (5.1) to the functions 

1 



Gmax(a,&) = max<^ a,b, 



a + b 



and 



Gmin(a, 6) = min 



1 1 

a 



a + b 



respectively. Equation (5.2) is satisfied since Gmax is bounded by 1 and G 
is bounded by 2. 

6. PlECEWISE LINEAR DESCRIPTION OF HOROCYCLES 



mm 

□ 



Before proving Theorems 1.6 and 1.7, we record an elementary lemma on 
the behavior of the length of vectors under hg , whose proof we leave to the 
reader. 

Lemma 6.1. Let v = {x,y)'^ G 6e a vector with x > 0. Let = 



a lb X 



where a 



-, is the slope of v. Then /(s) 



|^sv||sup is the 



continuous piecewise linear function given by 

(i) f'{s) = -\x\ for s < a- , 

(ii) /(s) = \x\ for a~ < s < , and 
(ii) /'(s) = for s > cr+. 
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6.1. Minima and maxima. Recall that Lemma l2 . 1 1 st at es that i7 consists 
of lattices with a short (length < 1) horizontal vector. This observation 
shows that the visits of the trajectory {hsA}s>o to the transversal O are 
precisely the times Sn(A) of local minima of iA{s), since a lattice cannot 
contain more than 1 vector of supremum norm at most 1, and under hs, 
vectors are shorter when they are horizontal. On $7, we have 

i {Aa,b) = a. 

By abuse of notation we write £ : 17 — t- (0, 1] with 

i{a, h) = a. 

Similarly we write a{a,h) = ^. Thus when /is„A = A^^ j,^, the length of the 
shortest vector in /is„A is given by the horizontal vector ^ ^ • Setting 

si(A) = min{s > : /i^A G Vt}, 
and writing Aa^^b^ = /isjA, we have that 

N N 

n=l n=l 

and 

AT N 

71=1 n=l 

Since both a and ^ are in L^{Q,dm), and we have assumed that A is not 
periodic under hs (and so, (ai, 6i) is not under T), we can apply the Birkhoff 
Ergodic Theorem (and the fact that all non-periodic orbits are generic for 
m, by the measure classification result) to conclude 

N N r 2 

A- N E = A- N E ' ^^)) = y. ''"^ = 3 • 

n=l n=l -^^^ 

and 

n=l n=l -^^^ 



proving Theorem 1.6, □ 



6.2. Returns. For the corresponding results for local maxima 5n(A), we 
need to consider the 'hand-off' between the basis vectors (a, 0)^ and (6, a~^)^ 
which happens after hitting the point A^ G Q. This hand-off happens when 
the vectors /is(a,0)^ = {a,—sa)'^ and hs{b,a~^Y' = {b, —sb + a"^)^ have 
the same length. That is, we need to calculate the time s £ {0, R{a,b)) so 
that 

max(a, sa) = max(6, —sb + a~^), 
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Figure 12. The hand-off between the vectors (a, —sb)'^ and 
{b,-sa + b-^f. 




c(a, sa) 



max(6, —sb + a "'") 



(0,0) 



{a-^b-^ - 1) a-^b- 



where we have taken absolute values of the coordinates (see Figure 12). A 
case-by-case analysis shows that the length of the vector at the hand-off 

is given by f{x,y) = max | a, 6, j^qz^j- By a similar analysis as above, we 

apply the ergodic theorem to this function (and to its reciprocal) to obtain 
Theorem ll.7[ □ 

7. Geometry of Numbers 



In this section we prove Theorem 1.13 and Corollary 1.14 



7.1. Slope gap distribution. We will prove Theorem] 1.13 by noting that 



the observations of ^2.1.2 yield the following: 



Lemma 7.1. Let t > 0, and let Tt denote the t-BCZ map. Let A denote 
a lattice without t- short vertical vectors, and let Ai = /i^^A he the first 



intersection of the orbit {/i<jA}s>o with the transversal fit- 
oo. Let Xcd denote the indicator function of the set 



LetO < c< d< 



R^^{c, d) 



{x,y) e^t-.^ <xy <- 
a c 



Then 



1 

N 



1 ^ 

|G^,(A)n(c,d)| = -^Xc,d(Tr(Ai)). 



i=0 



Proof. The lemma follows from the fact that hg preserves differences in 
slopes, and the function R captures these differences, for vectors in the 
strip Vj. In particular, 

RiTl'iAi)) = Sn+l - Sn, 

which yields the lemma. □ 



The proof of the theorem then follows as in ^6.1, by applying the Birkhoff 
ergodic theorem and Dani's measure classification. □ 
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7.2. Slope gap statistics. To prove Corollary 1.14, we first observe that 
if A has a vertical vector, its periodicity under hg implies that the sequence 
of slope gaps in Vt must repeat, so in particular there is an A'o = No{t) so 
that GN,t{^) = GNo,ti^) for all N > Nq. For example, in ^.1 we see that 
for the lattice Z^, Nq^I) = N{[t\), where N{Q) denotes the cardinality of 
the Farey sequence. Again using the conj uga tion relation (2.6), we see that 
GN,t{^) = G]\f,i{g-2\ogt^)- Using Lemma 7.1, we see that -^\G]\i^t{^)(~^{c,d)\ 
now corresponds to the integral of Xc,d along a periodic orbit for T, which, 
as t — )• oo, is getting longer and longer. By Theorem 1.3, this converges to 
the integral of Xc d with respect to Lebesgue measure m, as desired. □ 



8. Further Questions 

8.1. BCZ maps for other lattices. It was communicated to us by O. Sarig 
that it is well-known that the horocycle flow on SL(2,M)/T for any lattice F 
can be realized as a suspension flow over an adic transformation. A natural 
conjecture, then is: 

Conjecture. The BCZ map is an adic transformation. 

To our knowledge, the BCZ map is the first explicit description of a return 
map. We have the following immediate 

Question. How explicitly can the return maps for kg on 5L(2,M)/r for 
F / SL{2,Z) be given? 

The first author, in joint work with J. Chaika and S. Lelievre [3], has 
calculated a BCZ-type map for F = A(2, 5, oo), the (2, 5, oo)-Hecke triangle 
group (note that SL{2,Z) is the (2,3,oo) triangle group). 

8.2. Slope gaps for translation surfaces. The motivation for [3] was to 

generalize the calculation of the gap distribution for Farey sequences, which 
correspond to closed trajectories for geodesic flow on the torus, to a par- 
ticular example of a higher-genus translation surface, given by a particular 
L-shaped polygon known as the golden L. A more general question would 
be: 

Question. Is there a BG Z -type map for the space of translation surfaces. 
Can it he used to explicitly compute gap distributions for saddle connection 
directions. 

8.3. Mixing properties. While the horocycle flow hs is known to be mix- 
ing on X2 with respect to the measure /i2 (by, e.g., the Howe-Moore theo- 
rem [6j), mixing is a property that does not pass between flows and return 
maps. In fact, there are many well-known constructions of mixing suspen- 
sion flows over non-mixing base transformations. Thus, we have the natural 

Question. Is the BCZ map T mixing? 
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8.4. Rates of convergence. Flaminio-Forni |17j and Strombergsson 
have proved essentially optimal results for the deviation of ergodic averages 
for hs on X2. Again, precise control of deviations does not pass to the base 
transformation, so we have the following: 

Question. Can one give bounds on the deviation of ergodic averages for the 
BCZ map. 

Zagier [36j showed that proving an optimal rate of equidistribution for long 
periodic trajectories for hg on 5^(2, M)/S'L(2, Z) (that is, an optimal error 
term in Sarnak's theorem |32j ) is equivalent to the classical Riemann hy- 
pothesis. There is also an equivalent formulation of the Riemann hypothesis 
in terms of the distribution of the Farey sequence due to Franel-Landau [18j . 
This leads to 



Question. Is there an optimum hound on the error term in Theorem 1.3 
that is equivalent to the Riemann hypothesis? 
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